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Abstract — The problem of stabilizing an unstable plant over a 
noisy communication link is an increasingly important one that 
arises in problems of distributed control and networked control 
systems. Although the work of Schulman and Sahai over the past 
two decades, and their development of the notions of "tree codes" 
and "anytime capacity", provides the theoretical framework for 
studying such problems, there has been scant practical progress in 
this area because explicit constructions of tree codes with efficient 
encoding and decoding did not exist. To stabilize an unstable plant 
driven by bounded noise over a noisy channel one needs real-time 
encoding and real-time decoding and a reliability which increases 
exponentially with delay, which is what tree codes guarantee. We 
prove the existence of linear tree codes with high probability and, 
for erasure channels, give an explicit construction with an expected 
encoding and decoding complexity that is constant per time instant. 
We give sufficient conditions on the rate and reliability required 
of the tree codes to stabilize vector plants and argue that they 
are asymptotically tight. This work takes a major step towards 
controlling plants over noisy channels, and we demonstrate the 
efficacy of the method through several examples. 

I. Introduction 

In control theory, the output of a dynamical system is 
observed and a controller is designed to regulate its behavior. 
The controller needs to react and generate control signals in 
real-time. In most traditional control systems, the controller 
and the plant are colocated and hence there is no measure- 
ment loss. There are increasingly many applications such as 
networked control systems [1] and distributed computing [2] 
where systems are remotely controlled and where measurement 
and control signals are transmitted across noisy channels. This 
necessitates a need to reliably communicate the measurement 
and control signals by correcting for the errors introduced by the 
channels. Although Shannon's information theory is concerned 
with reliable transmission of a message from one point to 
another over a noisy channel, the reliability is achieved at 
the price of large delays which may lead to instability when 
they occur in the feedback loop of a control system. Hence, 
we need practical real-time encoding and decoding schemes 
with appropriate reliability for controlling systems over lossy 
networks. 

Consider a control system with a single observer that com- 
municates with the controller over a lossy communication 
channel and where the feedback link from the controller to 
the plant is noiseless. When the channel is rate-limited and 
deterministic, significant progress has been made (see eg., [3], 
[4]) in understanding the bandwidth requirements for stabilizing 
open loop unstable systems. When the communication channel 
is stochastic, [5] provides a necessary and sufficient condition 
on the communication reliability needed over such a channel 



to stabilize an unstable scalar linear process, and proposes the 
notion of feedback anytime capacity as the appropriate figure of 
merit for such channels. In essence, the encoder is causal and 
the probability of error in decoding a source symbol that was 
transmitted d time instants ago should decay exponentially in 
the decoding delay d. 

Although the connection between communication reliability 
and control is clear, very little is known about error-correcting 
codes that can achieve such reliabilities. Prior to the work 
of [5], and in a different context, [2] proved the existence 
of codes which under maximum likelihood decoding achieve 
such reliabilities and referred to them as tree codes. Note 
that any real-time error correcting code is causal and since it 
encodes the entire trajectory of a process, it has a natural tree 
structure to it. [2] proves the existence of nonlinear tree codes 
yet gives no explicit constructions and/or efficient decoding 
algorithms. Much more recently [6] proposed efficient error 
correcting codes for unstable systems where the state grows only 
polynomially large with time. So, for linear unstable systems 
that have an exponential growth rate, all that is known in the 
way of error correction is the existence of tree codes which 
are, in general, non-linear. Moreover, the existence results are 
not with a "high probability". When the state of an unstable 
scalar linear process is available at the encoder, [7] and [8] 
develop encoding-decoding schemes that can stabilize such a 
process over the binary symmetric channel and the binary 
erasure channel respectively. But little is known in the way 
of stabilizing partially observed vector-valued processes over 
a stochastic communication channel. 

The subject of error correcting codes for control is in its 
relative infancy, much as the subject of block coding was after 
Shannon's seminal work in [9]. So, a first step towards realizing 
practical encoder-decoder pairs with anytime reliabilities is to 
explore linear encoding schemes. We consider rate R = — 
causal linear codes which map a sequence of A: -dimensional 
binary vectors {b T }^° =0 to a sequence of n— dimensional binary 
vectors {c T }^ where c t is only a function of {& r }*_ . Such 
a code is anytime reliable if there exist constants j3 > 0, r\ > 
and a delay d a > such that at all times t, P\bt-d\t 7^ b t -d) < 

The contributions of this paper are as follows: 1. We show 
that linear tree codes exist and further, that they exist with a 
high probability. 2. For the binary erasure channel, we propose 
a maximum likelihood decoder whose average complexity of 
decoding is constant per each time iteration and for which the 
probability that the complexity at a given time t exceeds KC 3 
decays exponentially in C. 3. We also prove asymptotically tight 
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Fig. I. Causal encoding and decoding 



sufficient conditions on the rate R and exponent j3 needed to 
stabilize vector-valued processes over a noisy channel. As a 
consequence, we can efficiently stabilize a partially observed 
unstable linear process over a binary erasure channel without 
any channel feedback. 

In Section II, we introduce the notation and set up the 
problem. In Section III, we introduce the ensemble of time 
invariant codes and show that they are anytime reliable with 
a high probability. In Section IV, we present a simple decoding 
algorithm for the BEC and in Section V, we derive sufficient 
conditions for stabilizing unstable linear systems over noisy 
channels. We present some simulations in Section VIII to 
demonstrate the efficacy of the decoding algorithm. 

II. Problem Setup 
We will begin by introducing some notation 

1) For any matrix F,F = abs(F), i.e., F id = \F itj \.V i,j 

2) X(F) = largest eigen value of F in magnitude. 

3) For a vector x, xW denotes the i th component of x. 

4) l m = [1, . . . , 1] T , i.e., a column with to l's. 

5) For i«,d6 R m , w ^ v denotes component-wise inequality. 
Consider the following to— dimensional unstable linear system 
with scalar measurements. Assuming that the system is observ- 
able, without loss of generality, it can be cast in the following 
canonical form. 



x t +i = Fx t + Bu t + w t , y t = Hx t 
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where \(F) > 1, u t is the control input and, Wt and v t 
are bounded process and measurement noise variables, i.e., 

IKHoo < ¥ and \\vtWoo < 



^. Note that the characteristic 
polynomial of F is z n + a\z n ~ l + . . . + a m . 

The measurements {y t } are made by an observer while the 
control inputs {u t } are applied by a remote controller that is 
connected to the observer by a noisy communication channel. 
Naturally, the measurements yo-.t-i will need to be encoded 
by the observer to provide protection from the noisy channel 
while the controller will need to decode the channel outputs 
to estimate the state x t and apply a suitable control input Ut- 
This can be accomplished by employing a channel encoder at 
the observer and a decoder at the controller. For simplicity, we 



will assume that the channel input alphabet is binary. Suppose 
one time step of system evolution in (1) corresponds to n 
channel uses 1 . Then, at each instant of time t, the operations 
performed by the observer, the channel encoder, the channel 
decoder and the controller can be described as follows. The 
observer generates a k— bit message, b t e {0, l} fe , that is a 
causal function of the measurements, i.e., it depends only on 
ya-t- Then the channel encoder causally encodes &o:t £ {0, l} fc * 
to generate the n channel inputs ct <E {0, 1}™. Note that the 
rate of the channel encoder is R = k/n. Denote the n channel 
outputs corresponding to ct by z t E Z n , where Z denotes the 
channel output alphabet. Using the channel outputs received so 
far, i.e., Zou € Z nt , the channel decoder generates estimates 
{6 r | t } r < t of {b T } T <t, which, in turn, the controller uses to 
generate the control input ut+i- This is illustrated in Fig. 1. 
Note that we do not assume any channel feedback. Now, define 



P t e id = P[min{r:b rlt ^b T } = t 



1 



Thus, P ( e d is the probability that the earliest error is d steps in 
the past. 

Definition 1 (Anytime reliability): We say that an encoder- 
decoder pair is (R, /3, d )— anytime reliable if 



PL < V* 



-n/3d 



V t,d> d n 



(2) 



In some cases, we write that a code is (R, ft)— anytime reliable. 
This means that there exists a fixed d Q > such that the code 
is (R, (3, d Q )— anytime reliable. 

We will show in Section V (Theorem 5.1) that 
(i?, (3)— anytime reliability is a sufficient condition to stabilize 
(1) in the mean squared sense 2 . In what follows, we will 
demonstrate causal linear codes which under maximum 
likelihood decoding achieve such exponential reliabilities. 

III. Linear Anytime Codes 

As discussed earlier, a first step towards developing practical 
encoding and decoding schemes for automatic control is to study 
the existence of linear codes with anytime reliability. We will 
begin by defining a causal linear code. 

Definition 2 (Causal Linear Code): A causal linear code is 
a sequence of linear maps f T : {0, l} fcr h- » {0, 1}", r > and 
hence can be represented as 



/ T (6 liT ) = Grl&l + G r2 b 2 



G TT b T 



(3) 



I nxfc 



where G tj £ {0, 
We denote c T = /t(&1:t)- Note that a tree code is a more 
general construction where f T need not be linear. Also note 
that the associated code rate is R = -. The above encoding 
is equivalent to using a semi-infinite dimensional block lower 
triangular generator matrix, G n ,R, whose entries are clear from 
(3) or equivalently as a semi-infinite dimensional block lower 
triangular parity check matrix, M. n ^ (the parity check matrix 

'in practice, the system evolution in (1) is obtained by discretizing a 
continuous time differential equation. So, the interval of discretization could 
be adjusted to correspond to an integer number of channel uses, provided the 
channel use instances are close enough. 

2 can be easily extended to any other norm 



satisfies H n ^ R G„ tR = 0.) 
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H22 
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where 3 Hij <G {0, l} nx ™ and n = n(l — R). In order to ensure 
that the code rate is equal to the design rate R = fi needs 

to be full rank for every t, where fl is the nt x ni leading 
principal minor of M n ^ R . This will happen if Ha is full rank 
for all i. The existence results that follow imply the existence 
of anytime reliable M. n<R whose code rate is same as the design 
rate. 

We will present all our results for binary input output symmet- 
ric channels 4 . The Bhattacharya parameter £ for such channels 
is defined as 

00 

(= J Vp(z\X = 1)p(z\X = 0)dz 

—00 

where z and X denote the channel output and input, respec- 
tively. We will begin by proving the existence of such codes 
that are (R, f3) — anytime reliable over a finite time horizon, T, 
i.e., under ML decoding P% t < rftr pd , V t < T. We will 
then prove their existence for all time. Due to space limitations, 
proofs for all the results in this section are presented in a 
companion paper, [10]. 

A. Finite Time Horizon 

Over a finite time horizon, T, a causal linear code is 
represented by a block lower triangular parity check matrix 
Un,R,T € {0, i}™ rxnT . The following Theorem guarantees the 
existence of a M. n ^T that is (R, (3) — anytime relable. 

Theorem 3.1: For each time T > 0, rate R and exponent (3 
such that 



R< l-log 2 (l + C) 
(3<H~\l-R) (log 2 



and 
1 

c 
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there exists a causal linear code H(n,k,T) that is 
(R, (3) — anytime reliable. 

H^ 1 (1 — R) is the smaller root of the equation H (x) = 1 — R, 
where H(.) is the binary entropy function. Theorem 3.1 proves 
the existence of finite dimensional causal linear codes, H n jj j, 
that are anytime reliable for decoding instants upto time T. 
In the following subsection, we demonstrate the existence of 
semi-infinite causal linear codes, H„ ifl , that are anytime reliable 
for all decoding instants. We also show that such codes drawn 
from an appropriate ensemble are anytime reliable with a high 
probability. The key is to impose a Toeplitz structure on the 
parity check matrix. 

3 While for a given generator matrix, the parity check matrix is not unique, 
when G nj _fj is block lower, it is easy to see that H nj ij can also be chosen to 
be block lower. 

4 which can be easily extended to more general memoryless channels 



B. Time Invariant Codes 

Consider causal linear codes with the following Toeplitz 



structure 



B x 

H 2 Hi 

H T H T ^i . . . Hi 



The superscript TZ in Uj t z R denotes 'Toeplitz'. B^ Z R is ob- 
tained from M n ji in (4) by setting H^ = Hi_j + i for i > j. 
Due to the Toeplitz structure, we have the following invariance, 
P t e d = Pf, d for all t 7 t'. The notion of time invariance is 
analogous to the convolutional structure used to show the 
existence of infinite tree codes in [2]. The code M^ Z R will 
be referred to as a time-invariant code. This time invariance 
obviates the need to union bound over all time t and hence 
allows us to prove that such codes which are anytime reliable 
are abundant. 

Definition 3 (The ensemble TZ p ): The ensemble TZ p of 
time-invariant codes, is obtained as follows, Hi is any 

full rank binary matrix and for r > 2, the entries of H T are 
chosen i.i.d according to Bernoulli(p), i.e., each entry is 1 with 
probability p and otherwise. 

Note that Hi being full rank implies that H l n R is full rank for 
every t. For the ensemble TZ p , we have the following result 

Theorem 3.2 (Abundance of time-invariant codes): For any 
rate R and exponent f3 such that 



R < 1 



log 2 (l + C) 



and 



Iog 2 (l/(l-p))' 
(3 < H-\l-R) (log 2 +log 2 [(l -pT (1 - R) 1]) 

if M^ Z R is chosen from TZ p , then 

P (Ml z R is (R, (3, d ) - anytime reliable) > 1 - 2^™^ 

Note that by choosing p small, we can trade off better rates 
and exponents with sparser parity check matrices. Note that for 
BEC(e), C = e and for BSC(e), C = 2-^(1 - e). For the Binary 
Symmetric Channel (BSC) with bit flip probability e and for 
p = 5, the threshold for rate in Theorem 3.2 becomes R < 
1—2 log 2 (\/e+\/l — e )- It turns out that this can be strengthened 
as follows. 

Theorem 3.3 (Tighter bounds for BSC(e)): For any rate R 
and exponent (3 such that 

R<1- H(e), f3<KL (i2" _1 (l - R)\\ min{e, 1 - e}) 

if M^ Z R is chosen from TZ 1 , then 

P (UI Z R is (R, (3, d ) - anytime reliable) > 1 - 2^™^ 

IV. Decoding over the BEC 

Owing to the simplicity of the erasure channel, it is possible to 
come up with an efficient way to perform maximum likelihood 
decoding at each time step. We will show that the average 
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complexity of the decoding operation at any time t is constant 
and that it being larger than K C 3 decays exponentially in C. 
Consider an arbitrary decoding instant t, let c = [cj , 
be the transmitted codeword and let z — [zf 
the corresponding channel outputs. Recall that R denotes 
the fit x nt leading principal minor of H„ ^. Let z e denote 
the erasures in z and let H e denote the columns of R that 
correspond to the positions of the erasures. Also, let z e denote 
the unerased entries of z and let H e denote the columns of 
R excluding H e . So, we have the following parity check 
condition on z e , H e z e — H e z e . Since z e is known at the 
decoder, s = H e z e is known. Maximum likelihood decoding 
boils down to solving the linear equation H e z e = s. Due 
to the lower triangular nature of H e , unlike in the case of 
traditional block coding, this equation will typically not have 
a unique solution, since H e will typically not be full rank. 
This is alright as we are not interested in decoding the entire 
z e correctly, we only care about decoding the earlier entries 
accurately. If z e = [zj lt z^^Y \ then z e,i corresponds to the 
earlier time instants while z e 2 corresponds to the latter time 
instants. The desired reliability requires one to recover z ej i with 
an exponentially smaller error probability than z e 2 . Since H e 
is lower triangular, we can write H e z e = s as 



(6) 



Let H^22 denote the orthogonal complement of i? e . 22 , ie-, 
^e"22^e,22 = 0. Then multiplying both sides of (6) with 



H e .n 




Ze,l 




Si 


He,21 H e ^2 




Ze.2 
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diag(/,if e ,22), we get 



-He, 11 
^22^,21 





Si 


Ze.l = 





(7) 



If [-ffjn (-ff e L 22-^e.2i) T ] T has full column rank, then z e .i 
can be recovered exactly. The decoding algorithm now sug- 
gests itself, i.e., find the smallest possible H e ^2 such that 
Weil (-^e L 22^e,2i) T ] T has full rank and it is outlined in 
Algorithm 1. 

Algorithm 1 Decoder for the BEC 

1) Suppose, at time t, the earliest uncorrected error is at a 
delay d. Identify z e and H e as defined above. 

2) Starting with d' = 1,2, ... ,d, partition 



,T T IT 
'e.l Z e 2\ 



and H e 



H e ,u 

H e ,21 H e 22 



where z e 2 correspond to the erased positions up to delay 
d'. 

H e .il 



3) Check whether the matrix ± ' has full column 

L - H e,22- H e,21 

rank. 

4) If so, solve for z e .i in the system of equations 

#e, 11 
^22-^,21 

5) Increment t = t + 1 and continue. 





Si 


Z e ,l = 


_ H e,22 s 2 



A. Complexity 

Suppose the earliest uncorrected error is at time t—d+1, then 
steps 2), 3) and 4) in Algorithm 1 can be accomplished by just 
reducing H e into the appropriate row echelon form, which has 
complexity 0(d 3 ). The earliest entry in z e is at time t — d+1 
implies that it was not corrected at time t — 1, the probability of 
which is PjLj (_j < rj2^ nl) ^ 1 \ Hence, the average decoding 
complexity is at most KJ2d>a d 3 2~ nl3d which is bounded and 
is independent of t. In particular, the probability of the decoding 
complexity being Kd 3 is at most vftr nfid . The decoder is easy 
to implement and its performance is simulated in Section VIII. 
Note that the encoding complexity per time iteration increases 
linearly with time. This can also be made constant on average 
if the decoder can send periodic acks back to the encoder with 
the time index of the last correctly decoded source bit. 

V. Sufficient Conditions for Stabilizability 

Consider an unstable to— dimensional linear system whose 
state space equations in canonical form are given by (1), i.e., 
X(F) > 1, and recall that the characteristic polynomial of F is 
z n + aiz n ~ 1 + . . . + a m . Suppose the observer does not have 
any feedback from the controller, in particular, it does not have 
access to the control inputs. Then we can stabilize such a system 
in the mean squared sense over a noisy channel provided that 
the rate R and exponent j3 of the (R, j3)— anytime reliable code 
used to encode the measurements satisfy the following sufficient 
condition. 

Theorem 5.1 (No Feedback to the Observer): It is possible 
to stabilize (1) in the mean squared sense with an 
(R, (3)— anytime code provided (F,B) is controllable and 



R > R n 



1 



n 



i=l 



■ log 2 \{F) (8) 



If the observer knows the control inputs, it turns out that one 
can make do with lower rates. This is stated as the following 
Theorem 

Theorem 5.2 (Observer Knows the Control Inputs): When 
the observer has access to the control inputs, it is possible to 
stabilize (1) in the mean squared sense with an (R, (3)— anytime 
code provided (F, B) is controllable and 

R> R f n = argmin {\(FD nr ) < l} 

r 

P>tt = - log 2 X(F) 
n 

where D nr = diag (2r nr , !,...,!). Moreover 



(9a) 
(9b) 



R{ < - lo : 
n 



g 2 max^ \a m \2 



. max \ ai \2 l \ (10) 

Ki<m-1 ' 



The superscript / in i?£ denotes 'feedback' to emphasize the 
fact that the observer has access to the control inputs. Before 
proceeding further, we will give a brief outline of the proofs for 
Theorems 5.1 and 5.2 (details are in Section VII). At each time 
t, using the channel outputs received received till t, we bound 
the set of all possible states that are consistent with the estimates 
of the quantized measurements using a hypercuboid, i.e., a 



region of the form {x t E R m \x mm ^ t < x t < x max ^ t }, where 
x min,t\ti x max.t\t <= an d the inequalities are component- 
wise. If At| t = x maxAt - x mmAt , then from Lemma 7.1, 
A t+1 | t = FA t | t + Wl m . The anytime exponent is determined 
by the growth of A t in the absence of measurements, hence 
the bound j3 n = (3^ = 21og 2 X(F). The bound on the rate is 
determined by how fine the quantization needs to be for A t to 
be bounded asymptotically. 

A. The Limiting Case 

The sufficient conditions derived above are for the case when 
the measurements are encoded every time step. Alternately, one 
can encode the measurements every, say £, time steps, and 
consider the asymptotic rate and exponent needed as t grows. 
Note that this amounts to working with the system matrix F l . 
So, one can calculate this limiting rate and exponent by writing 
the eigen values of F, {Ai}™ l7 as Aj = /i" and letting n 
scale. The following asymptotic result allows us to compare 
the sufficient conditions above with those in the literature (eg., 
see [3], [5], [11]). 

Theorem 5.3 (The Limiting Case): Write the eigen values of 
F, {Ai}™ v in the form Aj = fif. Letting n scale, R n and R^ 
converge to R* , and j3 n and converge to B*, where 

R* = X! lo S2lMi|> P* = 21og 2 max|/Xi| (11) 

»:|Mil>l 

In addition, the upper bounds on R^ in (10) also converges to 
R*. 

Proof: See Section C of the Appendix. ■ 
For stabilizing plants over deterministic rate limited channels, 
[3] showed that a rate R > R* , where R* is as in (11), is neces- 
sary and sufficient. So, asymptotically the sufficient conditions 
for the rate R in Theorems 5.1 and 5.2 are tight. Though the 
above limiting case allows one to obtain a tight and an intuitively 
pleasing characterization of the rate and exponent needed, it 
should be noted that this may not be operationally practical. 
For, if one encodes the measurements every I time steps, even 
though Theorem 5.3 guarantees stability, the performance of the 
closed loop system (the LQR cost, say) may be unacceptably 
large because of the delay we incur. This is what motivated us to 
present the sufficient conditions in the form that we did above. 

B. A Comment on the Trade-off Between Rate and Exponent 

Once a set of rate-exponent pairs (R, B) that can stabilize 
a plant is available, one would want to identify the pair that 
optimizes a given cost function. Higher rates provide finer 
resolution of the measurements while larger exponents ensure 
that the controller's estimate of the plant does not drift away; 
however, we cannot have both. One can either coarsely quantize 
the measurements and protect the bits heavily or quantize them 
moderately finely and not protect the bits as much. One can 
easily cook up examples using an LQR cost function with the 
balance going either way. Studying this trade-off is integral to 
making the results practically applicable. 



VI. Tighter Bounds on the Anytime Exponent 

From Theorem 5.1, using the technique outlined in the previ- 
ous section, one needs an exponent nB > 2 log X(F). It turns out 
that a smaller exponent of 21og 2 X(F) suffices. The idea is to 
alternately bound the set of all possible states that are consistent 
with the estimates of the quantized measurements using an 
ellipsoid £(P,c) = {x e R m \(x - c, P~ 1 (x - c)) < l}. This 
can be seen as an extension of the technique proposed in 
[12] to filtering using quantized measurements. If m = 1, 
X(F) = X(F). So, let m > 2. 

In view of the duality between estimation and control, we can 
focus on the problem of tracking (1) over a noisy communica- 
tion channel. For, if (1) can be tracked with an asymptotically 
finite mean squared error and if (F, B) is stabilizable, then it is 
a simple exercise to see that there exists a control law {u t } 
that will stabilize the plant in the mean squared sense, i.e., 
limsup t E||x t || 2 < oo. In particular, if the control gain K is 
chosen such that y/2F + BK is stable, then u t — Kx t \ t will 
stabilize the plant, where x t \ t is the estimate of x t using channel 
outputs up to time t. Hence, in the rest of the analysis, we will 
focus on tracking (1). The control input u t therefore is assumed 
to be absent, i.e., u t = 0. 

We will first present a recursive state estimation algorithm 
using the channel outputs and then state the sufficient conditions 
needed for the estimation error to be appropriately bounded 
using such a filter. Recall that the channel outputs corresponding 
to the coded bits c t e GF£ are z t <G Z n . Let x e £{Pq, 0) and 
suppose using {z T } T < t -i, we have x t <E £(P t \ t -i, x t \t-i)- Note 
that, since H = [1,0,..., 0], the measurement update provides 
information of the form x^. ... < xi 1 ^ < x^ tU , which one 
may call a slab. £(P t \ t ,x t \ t ) would then be an ellipsoid that con- 
tains the intersection of the above slab with £(Pt\t-i, ^t|t-i)> 
in particular one can set it to be the minimum volume ellipsoid 
covering this intersection. Lemma A.l gives a formula for 
the minimum volume ellipsoid covering the intersection of an 
ellipsoid and a slab. Note that the width of the slab above tends 
to be smaller if the observer has access to the control inputs than 
when it does not. For the time update, it is easy to see that for 
any e > and P t+1 = (l + e)FP tlt F T + ^-l m , £(P t+ i, Fx t \ t ) 
contains the state x t+ \ whenever £(P t \ u x t \ t ) contains x t . This 
leads to the following Lemma. For convenience, we write P t 
for P t | t _!. 

Lemma 6.1 (The Ellipsoidal Filter): Whenever £(P ,0) 
contains x , for each e > 0, the following filtering equations 
give a sequence of ellipsoids {£ (P t \ t , x t \ t )} that, at each time 
t, contain x t . 

W 2 

P t+1 = (1 + e)FP t]t F T + —l m , x t+1 = Fx t]t (12a) 

Pt\t = o t Pt - (Pt - at )^r^ ' x t\t = & , T „ 

4 Ptei y/ei Pte! 

(12b) 

where a t , b t and £ t can be calculated in closed form using 
Lemma A.l. 

Using this approach, we get the following set of sufficient 
conditions. The proofs are similar to the proofs of Theorems 



5.1 and 5.2, and hence skipped due to space limitations. 

Theorem 6.2 (No Feedback to the Observer): It is possible 
to stabilize (1) for m > 2 in the mean squared sense with 
an (R, (3)— anytime code provided (F,B) is controllable and 



R > R e 



log 2 



E 

»=i 



8 > e , n = - log 2 \(F) 



(13a) 
(13b) 



where 6 = 

m—l 

Theorem 6.3 (Observer Knows the Control Inputs): When 
the observer has access to the control inputs, it is possible to 
stabilize (1) in the mean squared sense with an (R, 8)— anytime 
code provided (F, B) is controllable and 

R > RL 



rgmm{\(FD m , nr ) < l} 



/9># =-log 2 A(F) 



where D m , nr 
Moreover 

1 



dia g (^h2- n \Ve,...,Vo) , e 



Rl 



1 



log 2 max<^ \a m \(26) 



, max 2\a,\(29y 

Ki<ro-1 



(14a) 
(14b) 

m 
m—l * 

(15) 



In the same limiting sense as described in Section V, R{ n 
and R e , n converge to R* while fi[ n and 8 e _ n converge to 8*, 
where R* and 8* are as in the Lemma 5.3. The proof is in 
Section C of the Appendix. 



VII. Proofs of Theorems 5.1 and 5.2 

The analysis will proceed in two steps. We will first determine 
a sufficient condition on the number of bits per measurement, 
nR, that are required to track (1) when these bits are available 
error free. We will then determine the anytime exponent n/3 
needed in decoding these source bits when they are communi- 
cated over a noisy channel. 

At each time, we bound the set of all possible states that 
are consistent with the quantized measurements using a hyper- 
cuboid, i.e., a region of the form {x e W n \x m i n < x < x max }, 
where x min ,x max € K m and the inequalities are component- 



wise. In what follows, we call A t i T = x 



■A- 



■A 



., the 



uncertainty in x t using {b' T } T '< r , i.e., quantized measurements 
up to time r. For convenience, let A t = A t | t _i. Then, the time 
update is given by the following Lemma. 

Lemma 7.1 (Time Update): The time update relating A t +\ 
and A t | t is given by A t+1 = FA t | t + Wl m 

Proof: From the system dynamics in (1), the following is 
immediate 



W + max{|±a i A$+A<f t +1) 



1 t+l — " -r^a^^|-i-u,ti_i t | t 

= klA<V + A. (i+1) 



v (i+l) 



a, A 



(i) 
t\t 



A t|t 



1 t|t 



+ W. i<m-l 



(m) 
*t+l 



\<h, 



In short, the above equations amount to A t+1 = FA t i t + Wl m . 

m 

The measurement update depends on whether or not the 
observer has access to the control inputs. 

A. Observer does not know the control inputs 

In this case, the observer simply quantizes the measurements 
y t according to a 2 nR — regular lattice quantizer with bin width 
S, i.e., the quantizer is defined by Q : R i-)- {0, 1, . . . , 2 nR - 1}, 
where Q(x) = |_f J mod 2 nR . Assuming that the rate, R, is 
large enough, we will first find the steady state value of the 
recursion for A t , which we then use to determine R. At each 
time t, the observer can communicate the measurement y t to 
within an uncertainty of S, i.e., the estimator knows that the 
measurement lies in an interval of width 5. Adding to this the 
effect of the observation noise, — \ < v t < \, the estimator 
knows x^ to within an uncertainty of A$ = S + V. Note that 



A 



,(') 



t | t — u ( for i ^ 1. Combining this observation with Lemma 
7.1, the following is fairly straightforward. 

Lemma 7.2 (Steady State value of A t without feedback): If 
lim^co A t = Aoo, then A^ = (5 + V)L u a + WL u l m , where 
a = [\ai\,. . . , \a m \] T and L u = [tij]i<i,j< m with i i3 = Ii<j. 

Now, we need to go back and calculate R. Observe that 
Aoo does not depend on the starting value A . So we just 
need 62 nR > max jA^, Aq 1 -* j + V. From the above Lemma, 

= 8YZi \ai\ + VJ2T=i \a t \+mW. So, we need 



2 nR > max ■ 



V + VZZi\ a i\+ mW A o 



(1) 



The minimum required rate is obtained by letting 5 — > oo, in 
which case we need R > ^ log 2 Y^L 1 \(h\ and this gives R n 
in Theorem 5.1. 

B. Observer knows the control inputs 

In this case, the observer can infer that the uncertainty in 
y t at the estimator side is A^ 1 ' + V. So, it can use the nR 
bits to shrink this to 2~ nR (A i ). + V). Taking into account the 
observation noise, the uncertainty in x t after the measurement 



(i) 



update will be given by A t , t 
— A (*) f™- „• a. 1 rv 



V 



V) and 



A^u = A^ for i ^ 1. Combining this with Lemma 7.1, the 



overall recursion for A t is given by 

A t+ i = FD nR A t + W c . nRl where 
D nR = diag{2-"«, 1, . . . , 1} 
W c , nR = [V(l + 2- nR ) + W,W,...,W] T (16) 

Noting that V(l + 2~ nR ) < 2V, the above recursion is bounded 
if and only if FD nR is stable. It follows that FD nR is stable 
for all R > Rl, where R f n = \ argmin r {A (FD nr ) < l}. 

Now consider tracking (1) over a noisy channel. Intuitively, 
the desired anytime exponent is determined only by the growth 
of the tracking error in the absence of measurements, which by 
Lemma 7.1 is governed by F. This is independent of whether 
or not control input is available at the observer. This explains 
the value of /3 n — — ^ log 2 \{F) in Theorems 5.1 and 




(a) Open loop trajectory (b) Trajectory after closing 
the loop 

Fig. 2. 
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Fig. 3. The control performance of the code ensemble improves as the rate 
decreases 

5.2. Making this argument rigorous is simple and has not been 
presented here due to space limitations. 

VIII. Simulations 

We present two examples, one scalar and one vector, and 
stabilize them over a binary erasure channel with erasure 
probability e = 0.3. The number of channel uses per mea- 
surement is fixed to n = 15. In both cases, time invariant 
codes Hi 5 jv € TZi, for an appropriate rate R, were randomly 
generated and decoded using Algorithm 1. 

A. Example 1 

Consider stabilizing the scalar unstable process obtained by 
setting m = 1, — a± = 2 in (1) with wt and w t being uniform on 
[—30,30] and [—1, 1] respectively. Using Theorem 5.1, inorder 
to stabilize x t in the first moment sense, one needs a code 
with exponent j3 > - = 0.0667 and k — nR > 1. Using 
Theorem 3.2, causal linear codes exist for j3 < f3* = i? _1 (l — 
R) (log 2 (j-J + log 2 (2 1_fl — l)^. A quick calculation shows 

that for k = 6,n = 15, /3* = = 0.0761 > 0.0667. 

The observer does not have access to the control inputs, so an 
s fc — regular lattice quantizer with bin-width S was used to quan- 
tize the measurements. The control input is just u t — — x t \ t _ 1 . 
The four curves in Fig 3 correspond to the following sets of 
values: (k = 3, 8 = 16), (fc = 4, 8 = 8), (k = 5, 8 = 4) and 
(k = 6, 8 = 2). Fig 2 shows the plot of a sample path of the 
above process with k — 3, 8 — 16, L — 2 3 before and after 
closing the loop, the fact that the plant has been stabilized is 
clear. By easing up on the rate R, i.e., by performing coarser 
quantization but better error correction, the control performance 
of the code ensemble improves. This is demonstrated in Fig 
3. For each value of k from 3 to 6, 1000 time invariant 
codes were generated at random from TZi. Each such code 

° 2 

was used to control the process above over a time horizon of 




fraction of codes °" 7 



Fig. 4. The CDF of the LQR costs for different realizations of the codes 

T = 100. The X— axis denotes the proportion of codes for 
which sup (<100 E|xd is below a prescribed value, e.g., with 
k = 6,n = 15, sup f<100 E|x( | was less than 200 for 50% of 
the codes while with k = 3, n = 15, this fraction increases to 
more than 95%. The y— axis has been capped at 1000 for clarity. 
This shows that one can tradeoff utilization of communication 
resources and control performance. 

B. Example 2 

Consider a 3-dimensional unstable system (1) with a% = —2, 
a2 = —0.25, a3 = 0.5 and B = 1$. Each component of wt 
and vt is generated i.i.d N(0, 1) and truncated to [-2.5,2.5]. 
The eigen values of F are {2, -0.5, 0.5} while \(F) = 2.215. 
The observer has access to the control inputs and we use the 
hypercuboidal filter outlined in Section VII. Using Theorem 
5.2, the minimum required bits and exponent are given by 
k = nR > 2 and n/3 > 2 log 2 2.215 = 2.29. The control input is 
ut — —£ t \t-i- For k < 5, n(3 > 2.53. The competition between 
the rate and the exponent in determining the LQR cost is evident 
when we look at the LQR cost 7^ *£]=i E [INI 2 + ll u tll 2 ] in 
Fig 4. When k = 2, the error exponent nj3 = 6.3 is large. 
So, at any time t, the decoder decodes all the source bits 
{6 T } T <i_i with a high probability. Hence, the limiting factor 
on the LQR cost is the resolution the source bits bt provide 
on the measurements. But when k = 5, the measurements 
are available almost losslessly but the decoder makes errors in 
decoding the source bits. Fig 4 suggest that the best choice of 
rate is R = 3/15 = 0.2. 

IX. Conclusion 

We presented an explicit construction of anytime reliable 
tree codes with efficient encoding and decoding over erasure 
channels. We also gave several sufficient conditions on the rate 
and reliability required of the tree code to guarantee stability, 
and argued that they are asymptotically tight. Although the work 
described here is a major step towards controlling plants over 
noisy channels, there are many issues to study and resolve. The 
tradeoff between rate and reliability (how finely to quantize 
the measurements vs. how much error protection to use) to 
optimize system performance (such as an LQR cost) remains to 
be studied, as well as how best to quantize and generate control 
signals. Furthermore, the problem of constructing efficiently 
decodable tree codes for other classes of channels, such as the 
BSC and the AWGNC, remains open. 
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Appendix 

A. The Minimum Volume Ellipsoid 

Lemma A.l (Theorem 6.1 [13]): The minimum volume el- 
lipsoid £(P,c) covering 

{.t g R m \x G £(P,0),yVh T Ph < (h,x) < 5Vh T Ph} 

where \6\ > |7|, is given by 

Phh T P Ph 
P = bP-(b-a)^-±-, c = £^J= (17) 
h T Ph WWPh 

where 

1) If 7(5 < then £ = 0, a = b = 1 

2) If 7 + S = and jS > then 



£ = 0, a = m,5 , b 



2 ^ m(l-<5 2 ) 



m — 1 



3) If 7 + 5 ^ and j6 > then 

m( 7 + <5) 2 + 2(1 + 7(5) - yfD 



2(m + l)( 7 + (5) 



a = m(£- 7 )(£-<5), b - 



a — ay 



where D = m 2 (<5 2 - 7 2 ) 2 + 4(1 - 7 2 )(1 - <5 2 ) 



If |<5| < |7|, change x to —x and apply the above result. 

B. Upper bounds on and W e n 

There are several bounds in the Mathematics literature on the 
roots of a polynomial in terms of the polynomial coefficients, 
a standard and near optimal bound being the Fujiwara's bound 
which we state below. 

Lemma A.2 (Fujiwara's Bound): Consider the monic poly- 
nomial with complex coefficients f(x) = x m +c 1 x m ~ 1 + . . . + 
c m and let A(/) denote the largest root in magnitude. Then 



A(/) <#(/) =2max<n Cl |,| C2 



The upper bounds on and R{ n can now be proved as 
follows. The characteristic polynomial of FD nr is given by 
fcnr(x) =x m - 2- nr YZi Hx m - 1 . Applying Lemma A.2, 
if the rate R is larger than the smallest value of r that will make 
K(fc, nr ) < 1, then \{FD nR ) < K{f c , nR ) < 1 making FD nR 
stable. The bound for R^ is then immediate while the bound 
for R{ n follows by noting that the characteristic polynomial of 

FP > m,nr is X m - ^2~ nr ^Zl ^N*™"^ 

C. The Limiting Case 

Let F is any m-dimensional square matrix and f(x) denotes 
its characteristic polynomial. Then the following bounds hold 
(for details see [14]) 

\{F) 



\(F) < \(F) < 



K(f) < 2X(F) 



(18) 



y/2- i 

The proof for lim^oo f3£ — (3* follows easily from the first 
bound in (18). 

By the hypothesis of the Lemma, the eigen values of F n are 
of the form To emphasize the fact that F depends on 

n, we write it as F n and at as a^„. Recall that the characteristic 
polynomial of F n is given by f n (x) = x' m + a\_ n x m ~ Y + . . . + 
a m ^ n . Let I u = {i \ > 1}, then the following is easy to 
prove 



n^oo |a|x„|,n| 



0, i ^ |I„|, lim - log 2 \a\i 1 1 

n— >oo n 



log 2 
(19) 



We will prove that R n and R^ converge to R*, the proof 
for R e ^ n and R{ n is similar. From (19), it is obvious that 
linin^oo R n = ^2 ieX 1°S2 I A*i I ■ ^ remains to show that 
the limit holds for The characteristic polynomial of 

FD nr is given by / c , nr (x) = x m -_2~ nr YZ 1 \ai\x m -\ 
From (C), we have §A"(/c»r) < A(F7J„ r ) < K(f c , nr )). 



Define R J n l = argmin,. { \K(f Ct7lr ) < 1} 



and R s n 2 



f A 

71,1 ~ 

argmm r 

{K{fc,nr) < !}• Then, it is obvious that R f nl <R f n < 
Rn 2 - Using Lemma A.2, some simple algebra and taking limit 

n — > 00, we get 



1 



lim = lim — log 2 max 



a„ 



max 

Ki<m-1 



a 1 



Combining this with (19), we get the desired result, i.e., 



lim„ 



• Rf n = EieI„ lo g2lMi|- 



